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Abstract 

We study a class of program schemes, NPSB, in which, aside from basic assignments, non- 
deterministic guessing and while loops, we have access to arrays; but where these arrays 
are binary write-once in that they are initialized to ‘zero’ and can only ever be set to ‘one’. 
We show, amongst other results, that: NPSB can be realized as a vectorized Lindstrom 
logic; there are problems accepted by program schemes of NPSB that are not definable in 
the bounded-variable infinitary logic all problems accepted by the program schemes of 

NPSB have a zero-one law; and on ordered structures, NPSB captures the complexity class 
IjNP . The class of program schemes NPSB is actually the union of an infinite hierarchy 
of classes of program schemes. When we amend the semantics of our program schemes 
slightly, we find that the classes of the resulting hierarchy capture the complexity classes E? 
(where i > 1) of the Polynomial Hierarchy PH. Finally, we give logical equivalences of the 
complexity-theoretic question ‘Does NP equal PSPACE?’ where the logics (and classes of 
program schemes) involved define only problems with zero-one laws (and so do not define 
some computationally trivial problems). 


1 Introduction 

Finite model theory is essentially the study of logical definability over finite structures. 
An important sub-area of finite model theory is the relationship between the logical 
definability of classes of finite structures and computational complexity theory; that is, 
descriptive complexity theory. This relationship is best exemplified by Fagin’s seminal 
result that a problem, i.e., a class of finite structures over the same signature, can 
be defined by a sentence of existential second-order logic if, and only if, the problem 
(or, to be more precise, an encoding of it) can be accepted by a polynomial-time 
non-deterministic Turing machine |J. 

In two recent papers [|[ |3(J, we have studied logical definability in a slightly 
different context in that we have worked with classes of program schemes as opposed 
to more traditional logics. Program schemes are more computational in flavour than 
are formulae of traditional logics yet they remain amenable to logical manipulation. 

* Supported by EPSRC Grants GR/K 96564 and GR/M 12933. 
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The concept of a program scheme originates from the 1970’s with work of, for example, 
Constable, Friedman, Gries, Hewitt and Paterson |5, 10, 2t|; and complexity-theoretic 
considerations of such program schemes were subsequently studied by, for example, 
Harel, Jones, Muchnik, Peleg, Tiuryn and Urzyczyn 0 00- Our analysis of 
program schemes differs from what has gone before in that we are always concerned 
with finite structures (and not infinite ones as was often the case previously) and we do 
not assume that the elements of our finite structures are necessarily linearly ordered. 
Our studies have exhibited an extremely close relationship between certain classes of 
program schemes and the more traditional logics from descriptive complexity theory, 
and our methodology has enabled us to prove new results extending known results 
from descriptive complexity theory. For example, in jij we defined a hierarchy of 
classes of program schemes, NPSS, where these program schemes involve assignments, 
while loops and non-deterministic guessing and have access to a stack, before showing 
that this hierarchy is proper and (the union of it) has exactly the same expressive 
power as path system logic (a logic previously considered on ordered finite structures 
| [Fb>t p| and which has been shown to be none other than stratified fixed point logic 
and stratified Datalog on the class of all finite structures 00)- Furthermore, 
our hierarchy result translates to a strong hierarchy result about path system logic, 
a result which was previously unknown and not immediately derivable using known 
results and techniques from descriptive complexity theory (see [Q for more details). 

As well as equating classes of program schemes with more mainstream logics from 
descriptive complexity theory, we have also shown how naturally defined classes of 
program schemes result in logics which previously have not arisen in descriptive com- 


we con- 


plexity theory yet which have interesting properties. For example, in 130 
sidered a hierarchy of classes of program schemes, NPSA, similar to the program 
schemes of NPSS but with arrays replacing the stack (with the levels of the hierarchy 
denoted by NPSA(*), for i > 1). We showed that (the union of) this hierarchy NPSA 
can be realized as a vectorized Lindstrom logic (many of the mainstream logics of 
descriptive complexity theory are explicitly defined as vectorized Lindstrom logics: 
see Q] ), and that there are program schemes in this hierarchy (even in the first level) 
which accept problems not definable in the bounded-variable inhnitary logic C (a 
fundamental and encompassing logic in finite model theory: see [7j). However, we also 
show that, like any problem definable in every problem accepted by a program 

scheme of NPSA has a zero-one law. We remark that on ordered finite structures 
the class of program schemes NPSS captures the complexity class P and the class of 
program schemes NPSA captures the complexity class PSPACE (and in both cases 
the underlying hierarchy collapses to the first level). 

In this paper, in an attempt to ‘bridge the gap’ between the polynomial-time 
world of NPSS and the polynomial-space world of NPSA, we modify the program 
schemes of NPSA so that all arrays are ‘binary write-once’ in the sense that all array 
elements are initially set at ‘zero’ and the only modification to any array element 
allowed is to set it to ‘one’. The resulting class of program schemes is denoted NPSB, 
with the levels of the underlying hierarchy being NPSB(j), for i > 1. We show that 
NPSB retains some of the properties of NPSA: like NPSA, NPSB can be realized as 
a vectorized Lindstrom logic, and NPSB(l) contains problems not definable in 
(every problem accepted by a program scheme of NPSB trivially has a zero-one law 
as NPSB is a sub-class of NPSA); but whereas both NPSA and NPSA(l) capture 
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PSPACE on ordered structures, NPSB(l) captures NP and NPSB captures L NP , 
with the NPSB hierarchy collapsing to NPSB(3). 

We then amend the semantics of the class of program schemes NPSB in that we 
allow the current values of arrays to be ‘passed across’ to other program schemes 
(appearing as, what amounts to, subroutines in the main program scheme) in a com¬ 
putation (hitherto, the semantics has only allowed the current values of variables to 
be passed across). We denote the class of program schemes with this amended seman¬ 
tics as NPSB P (the subscript reflects the polynomially many values passed across to 
the component program schemes). We show that on the class of all finite structures, 
NPSB p (2i — 1) = NPSB p (2i) and captures £ p , i.e., the ith level of the Polynomial 
Hierarchy PH, for i > 2; and so NPSB P captures PH itself. 

Finally, we compare the relative expressibilities of the classes of program schemes 
NPSA(l) and NPSB(l). Recall that any problem accepted by any program scheme 
of these classes has a zero-one law; and so there are complexity-theoretic trivial 
problems (like the problem consisting of all those structures of even size over some 
fixed signature) not accepted by such program schemes. We show that NPSA(l) = 
PSPACE D £ XT and NPSB(l) = NP D EXT, where EXT is the class of problems 
closed under extensions (and thus every non-trivial problem in EXT has a ‘one-law’); 
and thus that the question of whether NP is equal to PSPACE is equivalent to the 
question of whether the two classes of program schemes NPSB(l) and NPSA(l) accept 
the same class of problems (and also equivalent to whether two particular fragments 
of two vectorized Lindstrom logics are equally expressive) on the class of all finite 
structures. 

This paper is structured as follows. In the next section, we outline the definitions 
relating to this paper before we define our classes of program schemes in Section 3. In 
Section 4, we identify the class of program schemes NPSB with a vectorized Lindstrom 
logic, before looking at our program schemes and logics on ordered structures in 
Section 5, where we also amend the original semantics of our program schemes. In 
Section 6, we compare the relative computational power of the classes of program 
schemes NPSB(l) and NPSA(l) (on the class of all finite structures) before presenting 
our conclusions and directions for further research in Section 7. 


2 Preliminaries 

The main reference texts for the basic concepts, notions and results of finite model 
theory are ]t], [h|] and it is to these books that we refer the reader for additional 
information. However, we give the definitions relating to this paper in some detail 
below as it is often the case that more generality is required than in ]^, [n| and we 
also need some notions not defined in those texts. 

Ordinarily, a signature a is a tuple (R±, ... , R r , Ci,..., C c ), where each R, is a 
relation symbol, of arity a t , and each Cj is a constant symbol. First-order logic over 
the signature a, FO(cr), consists of those formulae built from atomic formulae over a 
using A, V, -i, V and 3; and FO = U{FO(er) : a is some signature}. 

A finite structure A over the signature a, or a-structure, consists of a finite uni¬ 
verse or domain |A| together with a relation Ri of arity cq, for every relation symbol 
Ri of a, and a constant Cj G |A|, for every constant symbol Cj (by an abuse of 
notation, we do not usually distinguish between constants and relations, Cf- and Rf, 
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and constant and relation symbols, Cj and Rj). A finite structure A whose domain 
consists of n distinct elements has size n, and we denote the size of A by A| also 
(this does not cause confusion). We only ever consider finite structures of size at least 
2, and the set of all finite structures of size at least 2 over the signature cr is denoted 
STRUCT(ct). A problem over some signature cr consists of a subset of STRUCT(cr) 
that is closed under isomorphism; that is, if A is in the problem then so is every 
isomorphic copy of A. Throughout, all our structures are finite. 

The class of problems defined by the sentences of FO is denoted by FO also, 
and we do likewise for other logics. It is widely acknowledged that, as a means for 
defining problems, first-order logic leaves a lot to be desired; especially when we 
have in mind developing a relationship between computational complexity and logical 
definability. For example, every first-order definable problem can be accepted by a 
log-space deterministic Turing machine (where structures are encoded as strings) yet 
there are problems in the complexity class L (log-space) which can not be defined in 
first-order logic (one such being the problem consisting of all those structures, over 
any signature, that have even size). Consequently, a number of methods have been 
developed so as to increase definability. 

One method is to extend first-order logic using a vectorized sequence of Lindstrom 
quantifiers corresponding to some problem f2; or, as we prefer, an operator 12 for short. 
Suppose that O is over the signature cr, where cr = (i? 1; ..., R ri C i,..., C c ), as above. 
The logic (±12)*[FO] consists of those formulae built using the usual constructs of 
first-order logic and also the operator 12, where the operator f2 is applied as follows. 

• Suppose that V’i( x i,y), • • •, ^v( x r, y) are formulae of (±f2)*[FO] such that: 

— each Xj is a fcaj-tuple of distinct variables, for some fixed k > 1; 

— y is an ?n-tuple of distinct variables, for some m > 0, each of which is 
different from any variable of xi,..., x r ; and 

— all free variables of any ifi are contained in either x, or y. 

• Suppose that di,..., d c are fc-tuples of variables and constants (which need not 
be distinct). 

• Then: 

I2[Axi^(xi,y),... ,x r ^ r (x r ,y)](di,... ,d c ) 

is a formula of (±f2)*[FO] whose free variables are the variables of y together 
with any other variables appearing in di,..., d c . 

If $ is a sentence of the form f2[Axi^>i(xi),..., x r ?/v(x r )](di,... ,d c ), as above, over 
some signature o' then we interpret <f> in a cr'-structure A as follows (note that as <I> 
is a sentence, the variables of y are absent and the tuples di,..., d c , which are only 
there if there are constant symbols in cr, consist entirely of constant symbols of a'). 

• The domain of the cr-structure <I>(Al) is |A| fc . 

• The relation Ri of <I>(Al) is defined via: 

— for any u £ |$(Al)| ai = \A\ kai , Ri( u) holds in d>(Al) if, and only if, 
holds in A. 
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• The constant C :j of d>(.A) is defined via: 

— Cj is the interpretation of the tuple of constants dj in A. 

We define that A |= $ if, and only if, *I>(„4) £ 17 (the situation where $ has free 
variables is similar except that <f> is interpreted in expansions of (/-structures by 
an appropriate number of constants). We call logics such as (±17)* [FO] vectorized 
Lindstrom logics. We shall also be interested in fragments of vectorized Lindstrom 
logics where: the formulae are such that the operator f l does not appear within the 
scope of a negation sign, namely f7* [FO] (the positive fragment of (±17)* [FO]); and 
further the formulae are such that there are no nestings of the operator 17, namely 
17 1 [FO] (the positive unnested fragment of (±17)* [FO]). 

It can be the case that (a fragment of) a vectorized Lindstrom logic (±17)*[FO] 
has a very straightforward normal form; a normal form which obviates the need to 
nest applications of the operator 17 and which tells us something about the ‘degree of 
difficulty’ of the particular problem 17 with respect to the class of problems defined by 
the sentences of the logic. For example, suppose that every problem in (a fragment of) 
(±17)*[FO] can be defined by a sentence of the form 17[Axi^i(xi),... ,x r ?/v(x r )](di, 

.... d c ), as above, except where each tpi is quantifier-free first-order. Then we say that 
the problem 17 is complete for (the fragment of) (±17)* [FO] via quantifier-free first- 
order translations. This is directly analogous to completeness for some complexity 
class via some resource-bounded reduction: in fact, as we shall see, such normal forms 
can often yield very strong complexity-theoretic completeness results. 

Vectorized Lindstrom logics have been studied quite extensively in finite model 
theory and a whole range of complexity classes have been captured , i.e., characterized, 
by vectorized Lindstrom logics (see, for example, jlj, [Is], |2^, and the references 
therein). However, some (though not all) of these characterizations only hold in the 
presence of a built-in successor relation. Consider some vectorized Lindstrom logic 
(±17)*[FO]. To say that this logic has a built-in successor relation , which we denote 
by (±I7)*[FO s ], means that no matter which signature o' we are working over, there is 
always a binary relation symbol succ and two constant symbols 0 and max available 
(none of which is in o') such that succ is always interpreted as a successor relation 
with least element 0 and greatest element max in any /-structure. That is, for any 
/-structure of size n, succ is always of the form {(0, u\), {u\, u 2 ), ■ ■ ■, {u n - 2 , max )}, 
where the elements of {0, iti, U 2 , ■ ■ ■, u n - 2 , max} are distinct. However, there is a 
further semantic stipulation on the sentences of (±f7)*[FO s ]: we only consider as well- 
formed those sentences for which the interpretation in any structure is independent 
of the particular successor relation chosen. For example, define the problem TC over 
the signature 02 ++ = ( E, C, D ), where if is a binary relation symbol and C and 
D are constant symbols, as consisting of all those (72+-|_-structures for which, when 
considered as digraphs in the natural way, there is a directed path from the vertex C to 
the vertex D. Then the following sentence is a well-formed sentence of (±TC)*[FO s ] 
(as satisfiability in a given structure is invariant with respect to succ): 

TC[A(x!,x 2 ), (yi,y 2 )(xi = OAyi = max A succ(x 2 , y 2 )) 

V(xi = max A yi = 0 A succ(x 2 , 2 / 2 ))](0,0, max, max), 

and it defines the problem over the empty signature consisting of those structures of 
even size. (Note that in Q , the mechanism by which a successor relation is introduced 
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into a logic is slightly different from how we have described here in that only problems 
on ordered structures are ever considered: see [?]. Nevertheless, the two approaches 
essentially amount to the same thing and we shall refer to the two mechanisms inter¬ 
changeably). Note also that other relations can be built into logics in the same way 
as is a successor relation; or even just two distinct constants can be built in.) 

From a logical perspective, there is a problem with our built-in successor relation in 
the following sense. Given a sentence of first-order logic in which the relation symbol 
succ appears (and in which other constant and relation symbols might appear), it 
is actually undecidable as to whether the sentence is invariant with respect to succ. 
That is, there does not exist an effective enumeration of the well-formed sentences 
of FO s . Given this fact, it is highly debatable as to whether any ‘logic’ (±fl)*[FO s ] 
should really be called a logic; and it is an open question currently occupying much 
research activity as to whether there actually exists a logic capturing the complexity 
class P (polynomial-time), or indeed any complexity class contained in NP (non- 
deterministic polynomial-time), where by ‘contained in’ we really mean ‘contained in 
but expected to be different from’ (as NP itself can be captured by a logic, one such 
being existential second-order logic). The reader is referred to |2(J for more details 
on this and related points. (Notwithstanding the above discussion, we still refer to 
(±fl)*[FO s ] as a logic on the grounds of convenience.) 

Theorem |Ij, below, is an example of a normal form result. Define the problem 
CUB, over the signature cr 2 = (E), where E is a binary relation symbol, as follows. 

CUB = {Q £ STRUCT(ct 2 ) : the graph Q has a subset of edges inducing a 

regular subgraph of degree 3} 

(think of a 02 -structure Q as encoding an undirected graph via: ‘there is an edge 
(u,v) if, and only if, u ^ v and E(u,v) V E(v,u) holds in Q’). We shall need the 
following result later on. 

Theorem 1 [f28| The complexity class NP is identical to the class of problems defined 
by the sentences of CUB*\FO s 1; and any problem in NP can be defined by a sentence 
of CUB 1 [FO s ] of the form: 

CUB [Ax, y^(x, y )], 

where |x| = |y| = k, for some k > 1, and if is a quantifier-free formula of FO s . Hence, 
CUB*[FO s \ = CUB 1 [FO s ] = NP and CUB is complete for NP via quantifier-free 
first-order translations with successor. □ 

Note that Theorem [t] subsumes the ‘traditional’ known complexity-theoretic result 
that CUB is complete for NP via log-space reductions (a result attributed to Chvatal 
in Q). 


3 Program schemes 

Program schemes are more ‘computational’ means for defining classes of problems 
than are logical formulae. A program scheme p € NPSA(l) involves a finite set 
{xi,X 2 , ■ ■ ■, Xk} of variables, for some k > 1, and is over a signature a. It consists of 
a finite sequence of instructions where each instruction, apart from the first and the 
last, is one of the following: 
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• an assignment instruction of the form ‘x; := y\ where i £ {1,2, ...,fc} and 
where y is a variable from {xi, X 2 ,..., x*,}, a constant symbol of a or one of the 
special constant symbols 0 and max which do not appear in any signature; 

• an assignment instruction of the form ‘x* : = A\y\,y 2 , ■ ■ ■, yd]’ or ‘A[yi, t/ 2 , • • •, 
Vd] ■ ~ Vo\ for some i £ {1, 2,..., fc}, where each yj is a variable from {xi, X 2 , 
... ,Xfc}, a constant symbol of a or one of the special constant symbols 0 and 
max which do not appear in any signature, and where A is an array symbol of 
dimension d; 

• a guess instruction of the form 'guess xf, where i £ {1, 2,..., fc}; or 

• a while instruction of the form ‘while tp do cci; < 22 ; •..; a q od ! , where ip is 
a quantifier-free formula of FO(<r U {0, max}) whose free variables are from 
{xi, X 2 , ■ ■ •, Xfc}, and where each of «i, 02 , ■ ■ •, ce q is another instruction of one 
of the forms given here (note that there may be nested while instructions). 

The first instruction of p is 1 input (xi, X 2 ,..., x;)’ and the last instruction is ‘output 
(xi, X 2 , • • •, Xi)\ for some l where 1 < l < k. The variables xi, X 2 ,..., X; are the input- 
output variables of p , the variables xj+i,Xz+ 2 ,... , Xfc are the free variables of p and, 
further, any free variable of p never appears on the left-hand side of an assignment 
instruction nor in a guess instruction. Essentially, free variables appear in p as if they 
were constant symbols. 

A program scheme p £ NPSA(l) over a with t, free variables, say, takes a a- 
structure A and t additional values from \A\, one for each free variable of p, as input; 
that is, an expansion A' of A by adjoining t additional constants. The program 
scheme p computes on A' in the obvious way except that: 

• execution of the instruction ‘guess xf non-deterministically assigns an element 
of |A| to the variable xp 

• the constants 0 and max are interpreted as two arbitrary but distinct elements 
of \A\; and 

• initially, every input-output variable and every array element is assumed to have 
the value 0. 

Note that throughout a computation of p , the value of any free variable does not 
change. The expansion A' of the structure A is accepted by p 1 and we write A' |= 
p , if, and only if, there exists a computation of p on this expansion such that the 
output-instruction is reached with all input-output variables having the value max 
(in particular, some computations might not be terminating). We can easily build 
the usual ‘if’ and ‘if-then-else’ instructions using while instructions (see, for example, 
[pd|). Henceforth, we shall assume that these instructions are at our disposal. 

We want the sets of structures accepted by our program schemes to be problems, 
i.e., closed under isomorphism, and so we only ever consider program schemes p where 
a structure is accepted by p when 0 and max are given two distinct values from the 
universe of the structure if, and only if, it is accepted no matter which pair of distinct 
values is chosen for 0 and max. This is analogous to how we build two constant 
symbols into a logic. Furthermore, we can build a successor relation into the program 
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schemes of NPSA(l) so as to obtain the class of program schemes NPSA s (l). As with 
our logics, we write NPSA(l) and NPSA s (l) to also denote the class of problems 
accepted by the program schemes of NPSA(l) and NPSA s (l), respectively (and do 
likewise with other classes of program schemes). 

We have two remarks. First, our notation NPSA(l) reflects the fact that NPSA(l) 
is the first level of an infinite hierarchy of classes of program schemes, as we shall 
see presently. Second, as the definition of our class of program schemes NPSA(l) 
stands, we do not know whether the program schemes in this class can be recursively 
enumerated. However, we are prepared to live with this (possible) inconvenience as 
we could have defined the program schemes of NPSA(l) to be devoid of the constant 
symbols 0 and max and be such that initially every variable and array element is 
non-deterministically assigned the same element of the input structure (this would 
result in the same class of problems). Such a definition would mean that the class of 
structures accepted by such a program scheme is always closed under isomorphism 
(hence, recursive enumerability would not be an issue). However, there are three real 
reasons for having the constant symbols 0 and max in our program schemes. First, 
we can use 0 to initialize all variables and arrays, with the result that we never have 
to worry about whether an assignment involves an uninitialized variable or array 
element. Second, having two distinct constants around is useful when it comes to 
programming. Third, we shall soon use the constant symbol max to enable us to 
study ‘binary write-once’ arrays (that is, arrays where the elements can only be set 
to max and thereafter remain unchanged). 

Henceforth, we think of our program schemes as being written in the style of 
a computer program. That is, each instruction is written on one line and while 
instructions (and, similarly, if and if-then-else instructions) are split so that ‘while 
ip do’ appears on one line, ‘op’ appears on the next, ‘ 02 ’ on the next, and so on (of 
course, if any a* is a while, if or if-then-else instruction then it is split over a number of 
lines in the same way). The instructions are labelled 1,2, and so on, according to the 
line they appear on. In particular, every instruction is considered to be an assignment, 
a guess or a test. An instantaneous description (ID) of a program scheme on some 
input consists of a value for each variable, the number of the instruction about to be 
executed and values for all array elements. A partial ID consists of just a value for 
each variable and the number of the instruction about to be executed. One step in a 
program scheme computation is the execution of one instruction, which takes one ID 
to another, and we say that a program scheme can move from one ID to another if 
there exists a sequence of steps taking the former ID to the latter. 

As we hinted at above, the class of program schemes NPSA(l) is but the first level 
of an infinite hierarchy of program schemes. Suppose that we have defined a class of 
program schemes NPSA(2?n — 1), for some m > 1, and that any program scheme has 
associated with it: a set of input-output variables; a set of free variables; and a set of 
bound variables (this is certainly the case when m = 1, where the associated set of 
bound variables is empty). 

Definition 2 Let the program scheme p £ NPSA(2to — 1) be over the signature a. 
Suppose that p has: input-output variables xi,X 2 ,Xk', free variables Xk+i,Xk+2 , 
..., Xk+s ; and bound variables Xk + S +1 , Xk+ S + 2 , • • •, Xk+ s +t ■ Let xq, x, 2 ,..., Xi p be 



free variables of p, for some p (and so k + 1 < i\ < *2 < ■ ■ ■ < i P < k + s). Then: 

VXi 2 • • • '^•KipP 

is a program scheme of NPSA(2m), which we denote by p ’, with: no input-output 
variables; free variables those of {xk+i, Xk+2, ■ ■ ■ ,%k+s} \ {%ii,%i 2 , ... ,Xi p }\ and the 
remaining variables of {x±,X 2 , ■ ■ ■, Xk+ s +t} as its bound variables. 

A program scheme such as p ' takes expansions A! of cr-structures A by adjoining 
s — p constants as input (one for each free variable), and p’ accepts such an expansion 
A! if, and only if, for every expansion A" of A' by p additional constants (one for each 
variable ay,, for j G {1, 2,... ,p}), A" \= p (the computation on such an expansion 
A" always starts with the arrays initialised to 0). □ 


Definition 3 A program scheme p' € NPSA(2?n — 1), for some m > 2, over the 
signature a, is defined exactly as is a program scheme of NPSA(l) except that the 
test in any while instruction is a program scheme p G NPSA(2m — 2). The bound 
variables of p' consist of the bound variables of any test in any while instruction; all 
free variables in any test in any while instruction are input-output or free variables of 
p'; and there may be other free and input-output variables (appearing in p' at the ‘top 
level’ but not in any test). Of course, any free variable never appears on the left-hand 
side of an assignment instruction or in a guess instruction (at the ‘top level’). 

Suppose that a program scheme p' G NPSA(2to — 1) has s free variables. Then 
it takes an expansion A' of a cr-structure A by adjoining s constants as input and 
computes on A! in the obvious way; except that when some while instruction is en¬ 
countered, the test, which is a program scheme p G NPSA(2m — 2), is evaluated 
according to the expansion of A! by the current values of any relevant input-output 
variables of p' (which may be free in p). In order to evaluate this test, the arrays 
associated with p are initialized at 0 and when the test has been evaluated the com¬ 
putation of p' resumes accordingly with the values of its arrays and input-output and 
free variables being exactly as they were immediately prior to the test being evalu¬ 
ated. In particular, array values can not be ‘passed across’ in the evaluation of tests: 
the values of variables can be but they are never amended in the process. □ 

Consequently, we obtain a hierarchy of classes of problems: 

NPSA(l) C NPSA(2) C ... C U{NPSA(z) :i = 1,2,...} = NPSA 

(we use the inclusion relation between consecutive classes because this is how they 
are related as classes of problems). It is easy to see that, for one thing, FO C NPSA. 

In this paper, we are primarily interested in some sub-classes of program schemes 
of NPSA, namely the sub-classes NPSB(i), for i = 1,2,..., and the union of these 
classes NPSB, where the only allowed assignment instructions with an array element 
on the left-hand side are of the form A[x i, X 2 , ..., Xk\ : = max ; that is, the only values 
array elements can have are 0 and max , and once an array element is set to max then 
it remains at max thereafter (the notation reflects the binary nature of these arrays). 
Obviously, NPSB(i) C NPSA(i), for all * = 1,2,...; and NPSB C NPSA. 

Results concerning the program schemes of NPSA have already been obtained, 
and some of these results relevant to this paper are stated below. A problem fl, over 
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some signature a and where the domain of any u-structure of size n is taken to be 
{1, 2,..., n}, for which the function f(n), defined as the number of structures in 
of size n divided by the number of cr-structures of size n, is such that the limit as n 
tends to infinity exists and is 0 or 1 is said to have a zero-one law. 

Theorem 4 |Q 

(z) There exists a problem Q a , involving reachability in Petri nets, for which 

NPSA = (±n o )*[fp|, 

and the class of problems NPSA has a zero-one law. 

( ii) There is a quantifier-free first-order translation with 2 constants from any prob¬ 
lem in NPSA( 1) to the problem f2 a ; and so f l a is complete for NPSA(1) via 
quantifier-free first-order translations with 2 constants. 

(in) The problem CUB is in NPSA(1) but not definable in the logic C‘^ ou . 

(iv) In the presence of a built-in successor relation, the hierarchy NPSA s collapses 
to the first level, NPSA s ( 1), and captures the complexity class PSPACE. □ 

It is worth mentioning the role of the logic in finite model theory. This 

logic is an important logic for a number of reasons, one of which is that it subsumes 
many of the logics from finite model theory (including transitive-closure logic, least 
fixed point logic and partial fixed point logic) in that these logics can be realized as 
fragments of . Furthermore, C^ ow has a zero-one law and so any logic subsumed 
by Cfnu has a zero-one law. It is particularly interesting that NPSA(l) (and so also 
NPSA) can not be realized as a fragment of (as CUB is a problem in NPSA(l) 
that is not in a result proven in pg|]). 

In the absence of arrays, when the resulting class of program schemes is denoted 
NPS, and additionally in the presence of a stack, when the resulting class of program 
schemes is denoted NPSS, there are results analogous to parts (i), (ii) and (iv) 
of Theorem [| (see i) in that: both NPS and NPSS can be realized as vectorized 
Lindstrom logics so that the problems corresponding to the operators involved in these 
logics are complete for NPS(l) and NPSS(l) via quantifier-free first-order translations 
with 2 constants; and on ordered structures, the complexity classes captured are 
NL (non-deterministic log-space) and P, respectively. However, unlike NPSA, both 
NPS and NPSS can be realized as fragments of jC£, w . Furthermore, the underlying 
hierarchies of NPS and NPSS are proper at every level (even if we restrict to problems 
only involving trees) whereas, as we shall affirm later, all that is known as regards 
NPSA is that NPSA(l) c NPSA(2) c NPSA(3). 

4 Partitioned Petri nets 

We begin by describing a generalization of the digraph reachability problem to a 
scenario where the moves between nodes depend upon the availability and utilization 
of external resources. We first describe the basic decision problem in an everyday 
fashion before we consider a manifestation of it as a class of structures over a given 
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signature and see how this problem is related to computation in the program schemes 
of NPSB(l). 

Consider the following scenario. We are given a direct graph G = (V,E), where 
\V\ = n , with a source vertex source and a sink vertex sink , but where each edge 
is labelled with a (possibly empty) set of labels with each label being of one of the 
following forms: 

• ‘user resource r, is unused’; 

• ‘system resource Sj is available’; 

• ‘user resource rj is unused and this move uses this resource but makes the system 
resource Sj available (if it wasn’t available previously)’. 

There is a polynomial number of different user resources {?y : i = 1,2,... ,p(n)}, 
which are either in the state ‘used’ or the state ‘unused’; and a polynomial number of 
system resources {s, : i = 1 , 2 ,..., g(n)}, which are either in the state ‘available’ or 
the state ‘unavailable’ (for some polynomials p and q). A move in the digraph from 
vertex u to vertex v via the edge (it, v ) can only be made if either no labels label 
the edge ( u , v ) or at least one of the labels labelling the edge (u, v) is satisfied (with 
a resulting change in the state of a user resource, and possibly a system resource, if 
the label is of the third type). The question we ask is, given the initial state where 
all user resources are unused and no system resources are available, is it possible to 
move from source to sink in our given environment? That is, can the user use his or 
her resources wisely so as to enable a traversal in the digraph from the source to the 
sink? 

Note that whether a move can be made depends only on certain predicates involv¬ 
ing the states of the resources: for example, there are no moves dependent upon the 
state of a user resource being ‘used’ or of a system resource being ‘unavailable’. The 
situation is as it is as this decision problem arises naturally from our consideration of 
our program schemes; but we comment further on this problem and related problems 
in the Conclusion. 

We encode the above decision problem as a problem, i.e., class of finite structures, 
involving Petri nets. Our encoding is natural and has certain properties which we shall 
utilize later. The reader is referred to || for the basic notions and concepts relating to 
Petri nets (this reference also gives details of numerous complexity-theoretic results 
concerning fundamental problems in Petri nets). 

Definition 5 Define df, = (P, (). T\, 7 2 , T 3 , C. I)) where P, Q. T), T 2 and T 3 are 
relation symbols of arities 1, 1, 2, 3 and 4, respectively, and C and D are constant 
symbols. Let V be a db-structure. We can think of the elements of \V\ as being the 
places of a Petri net and the relations P and Q as describing two partitions of these 
places. We can think of: 

• the relation Tf as describing the set of transitions 

{({w}, {i>}) : u,v £ P and T\{u , v ) holds}; 

• the relation T 2 as describing the set of transitions 

{({m, i}, {u, i}) : u,v £ P,i ^ P,i £ Q and T 2 [u, v, i) holds} 
u {({ u !.?}; {<bj}) : u,v £ P,j £ P,Q and T 2 (u,v,j) holds}; 
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and 


• the relation T 3 as describing the set of transitions 

{(I'M}, {v,j}) : u, v <E P,i,j fL P,i € Q,j £ Q and T 3 (u,v,i,j) holds}. 


Furthermore, the initial marking of our Petri consists of the place C and the places 
not in P but in Q. We define the problem fib as 

{P € STRUCT(crb) : there is a marking reachable from the initial marking 
in which there is at least one token on the place D}. □ 


Note that the transitions encoded within a ovstructure V are of one of four types, 
as depicted in Fig. 1, and that the relations Ti, To. and T 3 of V might have additional 
tuples in them that do not affect how we think of P as a Petri net. 






T | transitions 


T-, transitions 


I 3 transitions 


Figure [[| The different types of transitions. 


With reference to our decision problem presented earlier, it should be clear that: 
the places in P correspond to the vertices V of our digraph G = (V,E), with C 
corresponding to the source vertex and D the sink vertex; the places not in P but in 
Q correspond to the user resources; and the places not in P and not in Q correspond 
to the system resources (henceforth, we shall use this terminology to describe the 
places of our Petri net). Additionally, the transitions described by Ti correspond to 
edges of E with no labels; the transitions described by T 2 yield edges labelled with 
labels of the form ‘user resource r, is unused’ and ‘system resource Sj is available’; 
and the transitions described by T 3 yield edges labelled with labels of the form ‘user 
resource jy is unused and this move uses this resource but makes the system resource 
Sj available’. We interpret a user resource as being in the state ‘unused’, if there is a 
token on it, and as being in the state ‘used’ otherwise (such places only ever have at 
most one token on them). It may be the case, in a reachable marking, that a system 
resource has more than one token on it. However, tokens can not be removed from 
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such places. Thus, it is only ever important as to whether a system resource has no 
tokens on it, when we think of it being in the state ‘unavailable’, or at least one token 
on it when, we think of it being in the state ‘available’. 

The proof of the following theorem is similar to those in |ioj although there are 
additional complications caused by only having assignments which set array values to 
max. 

Theorem 6 There is a quantifier-free first-order translation with 2 constants from 
any problem in NPSBfil ) to the problem f Hence, fib is complete for NPSB( 1) via 
quantifier-free first-order translations with 2 constants. 


Proof Let p be a program scheme of NPSB(l) over some signature a in which if 
and if-then-else instructions might occur. W.l.o.g., we may assume that array symbols 
only appear in assignment instructions, that there is only one array symbol, B , and 
that this array symbol has dimension d > 1. We assume that the variables involved 
in p are xi, x 2 ,..., Xk- 

Let A be a cr-structure of size n > 2. An element u = (uq, u\, ..., Uk) of 
{1,2,...,/} x |^4j fc encodes a partial ID of p on input A via: a computation of p 
on A is about to execute instruction it 0 and the variables X \, x 2 ,..., Xk currently 
have the values U\, U 2 , ■ ■ ■, it*,, respectively. Henceforth, we identify partial IDs of p 
and the elements of {1, 2 ,..., /} x |A| fc . 

We now build a Petri net P, as in Definition |B|, using p and A. Our Petri net 
V has a set of places consisting of the set {1,2 ,...,/} x |A| fc in union with the set 
{wo,w m : w G |A| d }. The sets of places P and Q are 

P = {1, 2,...,/} x |A| fc and Q = {w 0 : w G \A\ d }, 

respectively. Hence, the user resources are {wo : w G |A| d } and the system resources 
{w m : w G |*4| d }. We shall use a token on the user resource (wi, w 2 , ■ ■ ■, Wd)o to 
signify that the current value of B[w i, w 2 ,..., wj\ is 0; and a token on the system re¬ 
source (w i, W 2 , • ■ •, Wd)m to signify that the current value of B[wi,W 2 , ■ ■ ■, wj\ is max. 
Obviously, we have to take care to ensure that a marking does not yield contradictory 
interpretations. 

Let u G {1, 2,...,/} x |A| fc . 

Suppose that the instruction uq does not involve the array symbol B and it is 
possible for p on input A to move from any ID whose partial ID is u to an ID whose 
partial ID is v in one step. Then the transition ({u}, {v}) is in T\ (more precisely, 
the pair (u, v) is in Ti). 

Suppose that the instruction uq is of the form Xj := B[xi x ,Xi 2 ,... ,Xi d \ and it is 
possible for p on input A to move from any ID whose partial ID is u to an ID whose 
partial ID is v in one step (because the value of B[xi x ,Xi 2 ,..., x,: d ] is such that p on in¬ 
put A can move from an ID whose partial ID is u to an ID whose partial ID is v in one 
step). Then both of the transitions ({u, (it^, Ui 2 ,..., Ui d ) o}, {v, (u ^, Uj 2 ,..., Mi d )o}) 
and ({u, (u h ,u i2 ,.. {v, (u^u^,... ,u id ) m }) are in T 2 (of course, in the for¬ 

mer transition, Vj is 0, and in the latter Vj is max, with ut = Vi, for all / = 1, 2 ,..., k 
different from j ). 


13 


Suppose that the instruction u o is of the form B [, Xi 2 ,..., Xi d ] := max and it is 
possible for p on input A to move from an ID whose partial ID is u to an ID whose par¬ 
tial ID is v in one step. Then the transition ({u, (u ^, iq 2 ,..., Ui d ) m }, {v, (ui i: Ui 2 ,..., 
^2^)771}) is in r I 2 and the transition ({ U, (Wjj , Ui 2 , ■ • • , U id )o} , {v, (liq , Ui 2 ,..., Ui d ) m }) 
is in T3 (of course, in both transitions it,; = i>,, for i = 1, 2 ,..., k). 

Our initial marking of V is such that there is one token on each place of (wo : 
w € |A| d } and one token on the place (1, 0) G {1, 2,. .., /} x |A| fc , which we define to 
be C; and we define D as the place (/, max) G {1,2,...,/} x |A| fe . 

It is not difficult to see that our Petri net V (that is, our ovstructure V) can be 
described in terms of the u-structure A using quantifier-free first-order formulae (in 
which 0 and max appear: explicit descriptions of structures by quantifier-free first- 
order formulae are given in, for example, ^5)). Consequently, in order for the result to 
follow we need to show that: A |= p if, and only if, V G fib; and that fib G NPSB(l). 

Suppose that A \= p. Then there is a sequence 7r of (full, not partial) IDs starting 
at the initial ID (where all variables have the value 0, where the instruction to be 
executed is instruction 1 and where the array B has the value 0 throughout) and 
ending in a final ID (where all variables have the value max and where the instruction 
to be executed is instruction /) such that p moves from one ID in 7r to the next in one 
step. As hinted earlier, we can mirror any ID with a set of markings of our Petri net 
V as follows. If the ID consists of the partial ID u G {1,2, ...,/} x \A\ k together with 
some valuation on the array B then the place u is marked with one token as are the 
places of {w 0 : w G \A\ d , P[w] = 0}, and the places of {w m : w G \A\ d , P[w] = max} 
are marked with at least one token. This accounts for all tokens. Note that the initial 
ID of p corresponds to the initial marking of V. A simple analysis yields that if p 
on input A moves from one ID to another in one step then the Petri net can fire 
a transition to move from the marking corresponding to the first ID to a marking 
corresponding to the subsequent ID; and conversely (as remarked earlier, as regards 
the system resources, it does not matter how many tokens reside on them but only 
whether or not at least one token resides). Hence, A |= p if, and only if, V G fib- 

All that remains is to show that fib G NPSB(l). There are two essential difficulties 
in deriving a program scheme to accept fib- First, a Ub-structure V might be such 
that a reachable marking involves more than one token on some system resource; and 
we need to cater for this event when we simulate a sequence of transitions in V by 
an execution of a program scheme on input V. Second, we need to keep track of 
where tokens are in a way which avoids us modelling the fact that a token is on a 
place simply by using an array indexed by the place names; for we are not allowed 
to register that a token has moved from a place by assigning some array element the 
value 0 (recall, the only assignment instruction allowed on an array element is to set 
that element to max). 

Our Petri net V is such that initially there is one token, call it t , on the place C 
of P and there is one token on every user resource (we assume that the place C is 
indeed in P: otherwise, our program scheme simply rejects the input V). No other 
tokens are involved in the initial marking. Also, transitions are such that we can 
imagine the token t as being moved from place to place amongst the places of P, and 
we can imagine every other token either staying where it is, after some transition, or 
being moved from user resource to a system resource, and then staying where it is 
thereafter. 
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As regards our first difficulty, we do not need to actually monitor how many tokens 
lie on any system resource but only whether there is at least one token such a place. 
This obviates the need to count tokens. As regards our second difficulty, in order to 
decide whether (at least) one token lies on some system resource s, we use a dedicated 
array B\, of dimension 1, so that whenever a token is placed on such a s then -Bi[s] 
is set at max: once £?i[s] has been set to max we know that there will be a token 
on s thereafter. In order to decide whether a token lies on some user resource r, we 
use an array B 2 , of dimension 1, to register when the token originally on the place 
r is first moved from r by setting B?\r\ equal to max at this point. Consequently, if 
we wish to know whether there is a token on such a place r, we test to see whether 
B ‘2 [r] = 0 holds. Finally, we model the movement of the solitary token t by using a 
dedicated variable, x say: that is, the token t is on place p if, and only if, x has the 
value p. Given the above discussion, it is straightforward to see that the problem fIf, 
can be accepted by a program scheme of NPSB(l), and so the result follows. □ 

In essence, Theorem tells us that any problem accepted by a program scheme of 
NPSB(l) can be described by a sentence of the form 

n b [Xxil’p (x), xip Q (x), x, yV'i (x, y), x, y, z^ 2 (x, y, z), 

x, y, z, w^ 3 (x, y, z, w)](u, v), 

where: |x| = |y| = |z| = |w| = k, for some k > 1, and all variables are distinct; 1 pp, 
ipQ, ip 1 , ip 2 and 1 1>3 are quantifier-free first-order formulae over cr b U {0, max}; and 
u and v are fc-tuples of constant symbols (in fact, we can actually take u to be 0 
repeated k times and v to be max repeated k times: moreover, the sentence is such 
that whether it is true in some given structure is independent of the distinct values 
chosen for 0 and max). 

Similarly to as in Q, Theorem |f] allows us to relate the class of problems accepted 
by the program schemes of NPSB with the class of problems defined by the sentences of 
the logic (±f2f,)*[FO]. For each m > 1, we define the fragment ±fl b (m) of (±fl b )*[FO] 
as follows. 

• ±11&(1) consists of all formulae of the form 

n b [Xxtp P , xip Q , x, y0i, x, y, z^ 2 , x, y, z, w^ 3 ](u, v), 

where: ipp, ipQ, ip\, ip 2 and ip3 are quantifier-free first-order formulae over 
<Tb U {0, max}] u and v are fc-tuples of constant symbols or variables; there may 
be other free variables; and the truth of any interpretation of the formula (over 
a relevant structure and with values given for any free variables) is independent 
of the pair of distinct values chosen for 0 and max. 

• ±f2&(?n + l), for odd m > 1, consists of the universal closure of ±f2 b (m); that is, 
the set of formulae of the form VziVz 2 ... Vzkip, where ip is a formula of ±fl&(m). 

• ±S1 b{m + 1), for even m > 2, consists of the set of formulae of the form 

IIb[Ax(V>p V ~^ip 2 P ), x^q V ~^iPq), x, y (ip} V ->^), x, y, z [ip\ V -.^ 2 )- 

x,y,z,w (ipl V ~ripl)}{u,v), 
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where: ipp> ipp, ipQ, ipQi V’l) V’i; V’l) V’i) V’i and V’l are formulae of ±f 2 f,(m); 
u and v are tuples of constant symbols or variables; there may be other free 
variables; and the truth of any interpretation of the formula (over a relevant 
structure and with values given for any free variables) is independent of the 
pair of distinct values chosen for 0 and max. 


As in p 8 | , a straightforward induction yields that: 

• for every odd m > 1, every formula in the closure of iflb(m) under A, V and 3 
is logically equivalent to a formula of ±f 2 f,(m); and 


• for every even m > 1, every formula in the closure of ±Qb(m) under A, V and 
V is logically equivalent to a formula of ±Uf,(m). 

Consequently, (±f2j,)*[FO] = U{fi;,(m) : m > 1}. 

Corollary 7 In the presence of 2 built-in constant symbols, ±flb(?n) = NPSB(m), 
for each m > 1; and so (±f2;,)*[-FO] = NPSB. 


Proof We proceed by induction on m almost identically to the proof of Corollary 
10 from |3(J . The base case, when m = 1, follows by Theorem | □ 

Note that (±f2j,)*[FO] = NPSB even in the absence of our 2 built-in constant 
symbols as we can ‘build them ourselves’ using existential quantification. 

We end this section by showing that NPSB can not be realized as a fragment of 
£-oou> (unlike NPS and NPSS). 

Lemma 8 The problem CUB can be accepted by a program scheme of NPSB (1). 

Proof It was shown in |j(J that CUB is in NPSA(l): however, the program scheme 
used there to accept CUB is not in NPSB(l). Nevertheless, the basic approach can 
be amended to yield a program scheme of NPSB(l). 

Let Q be a ^-structure. We begin by ‘guessing’ a set of distinct edges in the 
graph Q. We use two 3-dimensional array symbols, B\ and B 2 , to store these guessed 
edges. In particular, if our first guessed edge is (ui,Vi), having checked that (ui,Vi) 
is indeed an edge of Q , we set f?i[0,0,ui] = max and _B 2 [0,0, Wi] = max. Next, 
we guess an edge (u 2 , v 2 ), check to see whether this edge is indeed an edge of Q 
and then check to see whether this edge is different from (ui,ui). If so then we set 
Vi, u 2 ] = max and B 2 [u\, v\, v 2 ) = max: otherwise, we set Bi[ui,vi,max] = 
max and B 2 [ui,vi,max] = max and stop guessing. We continue in this fashion until 
the guessing stage stops whence we have a list of distinct edges of Q. 

Finally, we check to see whether the guessed set of edges induces a regular subgraph 
of Q of degree 3. It is clear that this whole process can be implemented by a program 
scheme of NPSB(l): hence, the result follows. □ 

The facts that the problem CUB can not be defined in (see |29|) and that 
non-recursive problems can be defined in (see |?j ) immediately yield the following 

result. 

Corollary 9 There are problems definable in NPSB{1) (and so NPSB) which are not 
definable in and there are problems definable in which are not definable in 

NPSB. □ 
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5 Ordered structures and amended semantics 


Given our characterization, in the preceding section, of the class of problems accepted 
by the program schemes of NPSB, we now consider the class of problems accepted by 
these program schemes when we restrict ourselves to ordered structures. 

Using Theorem we can easily modify the program scheme implicit in the proof 
of Lemma || so that, in the presence of a built-in successor relation, it accepts any 
given problem in NP. Conversely, any problem in NPSB s (l) is in NP. Theorem |g] 
then yields the following result. 

Corollary 10 As classes of problems, NP = NPSB s (l); and flf, is complete for NP 
via quantifier-free first-order translations with successor. □ 

By Corollary^, NPSB s = (±12&)*[FO fl ]; and by Corollary 5.5 of Q and Corol¬ 
lary & (±Ut,)*[FO s ] = (±HP)*[FO s ], where HP is the problem over the signature 
( 72 ++ consisting of all those 02 ++ -structures A which, when considered as digraphs 
with edge relation E A and two given vertices C A and D A , are such that there 
is a Hamiltonian path from C A to D A . Furthermore, by Corollary 3.2.2 of |f26| , 
(±HP)*[FO s ] = L np (the class or problems accepted by a log-space deterministic or¬ 
acle Turing machine with access to an NP oracle), and every problem in (±HP)*[FO s ] 
can be defined by a sentence of the form: 

3z x 3z2 ... 3z m (HP[Ax, yip(x, y, z)](0, max) A ->HP[Ax, y<p(x, y, z)](0, max)), 

where: x and y are fc-tuples of variables, for some k; ip and ip are quantifier-free first- 
order formulae (with successor); and 0 (resp. max) is the constant symbol 0 (resp. 
max) repeated k times. Hence, translating this normal form into a program scheme 
yields that any problem in NPSB s can actually be accepted by a program scheme of 
NPSB s (3). Furthermore, any problem accepted by a program scheme V 21 V 22 • • • 
of NPSB s (2) can be accepted by a program scheme of NPSB s (l): we simply replace 
the universal quantification by code within a program scheme of NPSB s (l) which 
uses a while instruction and the successor relation to check whether a structure is 
accepted by p for every valuation of the free variables z\, Z 2 , ■ ■ ■, z m . Hence, we have 
the following result. 

Theorem 11 NPSB s ( 1) = NPSB s { 2) = NP and NPSB s { 3) = NPSB s = L NP . □ 

Let us now amend our semantics of the program schemes of NPSB. When we 
defined the semantics of a program scheme p of NSPB(2?' + 1), for some i > 0, we 
insisted that when a test in some if-then-else or while instruction is evaluated (recall, 
such a test is a program scheme of NSPB(2i)), the only values used in this evaluation 
are the current values of the variables of p. In particular, all arrays involved in 
the evaluation are initialized to 0 prior to the evaluation. Suppose that we now 
insist that arrays used in the evaluation are initialized to their current values prior 
to the evaluation. Consequently, not only can we pass the current values of the 
variables across to an evaluation, we can pass the current values of the arrays across 
too (or course, when the program scheme p resumes after evaluation of the test, the 
values of the arrays are what they were prior to the evaluation of the test). We 
denote the program schemes of NPSB with this semantics as NPSB p to reflect the 
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fact that a polynomial number of values is passed across in an evaluation (rather than 
just a constant number in the standard semantics). Allowing a polynomial number 
of values to be passed across to an evaluation drastically changes the expressibility 
of the resulting class of program schemes (modulo the usual complexity-theoretic 
qualifications). The complexity class PH is the Polynomial Hierarchy ; that is, PH = 
U^E?, where E p = NP and where, for each i > 2, E p = NP 5 ^- 1 (the class of 
problems accepted by a polynomial-time non-deterministic oracle Turing machine 
with access to a E p _ x oracle). 

Theorem 12 NPSB P ( 1) = NPSB( 1), NPSB P ( 2) = NPSB( 2) and for every i > 2, 
NPSB p (2i - 1) = NPSB p (2i) = E p . Consequently, NPSB P = PH. 

Proof Similarly to the proof (elucidated immediately prior to Theorem that 
NPSB s (l) = NPSB s (2), so we can show that NPSB p (2z — 1) = NPSB p (2z), for all 
z > 2. Obviously, NPSB P (1) = NPSB(l) and NPSB P (2) = NPSB(2) (as our original 
semantics and our amended semantics do not differ in these cases). 

We now show how to build our own successor relation using a program scheme of 
NPSB P (3). Essentially, we guess a successor relation and store it in the array S , of 
dimension 2 , via the following code: 

x : = 0 

while x ^ max do 
guess y 
if a ' ^ y then 
S[x,y] := max 
x := y 
fi 
od 

Then we check, using an if-then-else instruction with the test a program scheme of 
NPSB P (2), that every value appears in the guessed relation S and that no value 
appears more than once. Consequently, by Corollary |Io], any problem in NP can be 
accepted by some program scheme of NPSB P (3). 

Not withstanding the above remark, we would like to explicitly simulate a non- 
deterministic polynomial-time Turing machine computation using a program scheme 
of NPSB P (3). We can use arrays to store the work-tape of any such Turing machine 
and our successor relation, held in S , to mirror the movement of the tape heads. Our 
only restriction to this simulation is that we can only set array values at max: we 
can not reset them to 0. Hence, the obvious means of simulation is doomed to failure 
given that, in general, the contents of a cell of a Turing machine work-tape fluctuate 
and that if we simulate a cell of the work tape using a fixed number of array elements 
then we can only register a constant number of changes to the cell contents. However, 
we can get round this difficulty by using the fact that any (accepting) computation of 
our Turing machine has length polynomial in the size of the input structure: hence, 
we can use an array to store the complete history of changes to the contents of a 
Turing machine work-tape cell as follows. 

For simplicity, assume that we wish to hold the contents of n Turing machine 
work-tape cells (where the input structure has size n) using some arrays and that 


18 


these contents are only ever 0 or 1. Furthermore, assume that the time taken by our 
Turing machine to accept (if it does) is n. The general case where a cell can contain 
more symbols, where there is a polynomial number of work-tape cells to deal with and 
where the Turing machine accepts in a polynomial number of steps can be handled 
similarly by increasing the dimensions of our arrays. Let A and B be array symbols 
of dimension 2. Using our successor relation (constructed earlier), we use the array 
cells A[u, 1], A[u, 2 ],..., A[u, n] (we think of the elements of our input structure as 
being named {1, 2 ,..., n} with the names reflecting our successor relation) to register 
the first change of the contents of the work-tape cell n, the second change of the 
work-tape cell u, the third change of the work-tape cell u, and so on; and the array 
cells B[u, 1], B[u, 2],..., B[u, n ] to register the value of work-tape cell u after the first 
change, the value of work-tape cell u after the second change, the value of work-tape 
cell u after the third change, and so on. 

If A[u,i] = max then this is interpreted as meaning that there have been at least 
i changes of contents; and if B[u,i\ = 0 (resp. B[u,i] = max ) then this is interpreted 
as meaning that after the ith change, the contents of work-tape cell u is 0 (resp. 1). 
Note that when the work-tape cell u changes from 1 to 0, on the ith change, say, in 
order to register this change we need only set A[u,*] = max and leave B[u,i] alone 
(as it has been initialized to 0). Furthermore, with this representation, and using 
our successor relation, we can easily determine the current contents of any work-tape 
cell: we simply cycle down the array A to find the last change of contents and then 
ascertain the current contents using B. Thus, it should be clear how we can explicitly 
simulate our Turing machine computation using a program scheme of NPSB P (3). 

Now, consider a polynomial-time non-deterministic oracle Turing machine M con¬ 
sulting an NP oracle. By Corollary [l(], and using an array to hold the contents of 
the oracle tape, we can simulate an oracle call of M by an if-then-else instruction 
where the test is a program scheme of NPSB P (2) (exactly because we are allowed, in 
our modified semantics, to pass the values of arrays over to the evaluation of a test). 
Hence, we have essentially proven that any problem in NP np can be accepted by a 
program scheme of NPSB P (3). Conversely, it is straightforward to see that any prob¬ 
lem accepted by a program scheme of NPSB P (3) can be accepted by a polynomial-time 
non-deterministic oracle Turing machine with an oracle in NP (the only point worthy 
of note in this regard is that we must ensure that the contents of all arrays in the 
program scheme are written on the simulating Turing machine’s oracle tape). Hence, 
NPSB P (3) = NP np . 

The general result now follows by a simple induction: for example, any polynomial¬ 
time non-deterministic oracle Turing machine consulting an oracle in NP np can be 
explicitly simulated; and by above the oracle calls can be simulated by if-then-else 
instructions where the tests are program schemes from NPSB P (4). □ 

6 Some relative computational capabilities 

We now turn to the relative computational capabilities of the classes of program 
schemes NPSB(l) and NPSA(l) on the class of all finite structures (we have more to 
say about comparing the classes NPSB and NPSA in the Conclusion). 

The following definitions are essential to what follows. Let a be some signature 
and let A and B be cr-structures. If |A| C \B\ and: 
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• for every relation symbol R of a, R A is R B restricted to \A\; and 

• for every constant symbol C of a, C A = C B , 

then we say that A is a sub-structure of B and write A C B. If the problem f l over a 
is such that for all cr-structures A and B for which A C B, it is necessarily the case 
that A £ Cl implies B £ Cl, then we say that f 1 is closed under extensions. Let EXT 
be the class of all problems that are closed under extensions. 

Lemma 13 Every problem in NPSA( 1) is closed under extensions. 


Proof Let 11 be a problem over the signature a accepted by the program scheme p 
of NPSA(l). Let A and B be cr-structures such that AC B, and suppose that A\= p. 
Consider the program scheme p on input B where 0 and max are chosen to be distinct 
elements of |A|. By ‘mirroring’ an accepting computation of p on input A, with the 
chosen 0 and max , we obtain an accepting computation of p on input B (the fact that 
all tests in while, if and if-then-else instructions are quantifier-free first-order enables 
us to do this). Hence, B £ Cl. □ 

Theorem 14 PSPACE D EXT = NPSA( 1) and NP n EXT = NPSB{ 1). 


Proof Let Cl be some problem in PSPACE Cl EXT. By |24|], there exists a program 
scheme p £ NPSA s (l) accepting Cl. Modify p to obtain the program scheme p' £ 
NPSA(l) as follows. In p', begin by guessing a successor relation; that is, when A is 
some input structure, guess elements u±,U 2 , ■ ■ ■, u m £ |M| so that 

M[ 0] = Mi, M[ui] = it 2 , ■ • •, M[u m ] = max , 

where M is a new one-dimensional array symbol and where the elements of {0, mi, M 2 , 
..., u ra , max} are distinct (this latter condition can be checked as we guess). Replace 
any atomic relation of the form succ{x, y) in p with the formula y = M[x\ , and replace 
any instruction of the form guess x with the following fragment of code: 

guess x 
goodx : = 0 
ok := 0 

while ok = 0 do 

if tx = goodx V goodx = max) then 
ok : = max 

else 

goodx := M [goodx] 

fi 

od 

if I ^ goodx then ‘loop forever’ fi 

(where goodx and ok are new variables). Note that this fragment of code essentially 
limits our guesses to elements appearing in the domain of our guessed successor rela¬ 
tion. We need to show that acceptance by the program scheme p' is invariant with 
respect to 0 and max and that it accepts the problem Cl. 
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Suppose that A £ fi. Then A is accepted by p no matter which successor relation 
is chosen for succ in p. Choose distinct O' and max' in |A| and a successor relation 
succ' on |A| (with minimal and maximal elements the chosen elements O' and max'). 
In particular, p accepts A with these constants and this successor relation. Consider 
a computation of p' on input A where the guessed successor relation is sued. Then 
there exists a computation of p' mirroring any accepting computation of p on input 
A with this particular successor relation. That is, A is accepted by p' and acceptance 
does not depend upon the chosen constants 0 and max. 

Conversely, suppose that there is a guessed successor relation, call it sued (whose 
domain need not be all of |A|), with minimal and maximal elements 0 / and max ', 
yielding an accepting computation of p' on input A. Let B C |„4| be the domain of 
this successor relation and let B be the restriction of A to B. Then B is accepted 
by p when the successor relation is taken as sued (note that the domain of sued 
is the whole of \B\). Hence, B £ H. However, il is closed under extensions and so 
A £ fi. But we have seen from above that if A £ H then A is accepted by p' and 
acceptance does not depend upon the chosen constants 0 and max. Thus, acceptance 
by p' is invariant with respect to 0 and max ; and PSPACE £]£XT C NPSA(l). The 
fact that every problem in NPSA(l) can be solved by a polynomial-space algorithm 
is straight-forward; and every problem in NPSA(l) is closed under extensions by 
Lemma m 

Now consider a problem H £ NP n £ XT accepted by the program scheme p £ 
NPSB s (l). We proceed as above, and define a program scheme p' £ NPSB(l), except 
with the following amendment. In NPSB(l), we are only allowed assignments to array 
elements of the form M[x i, £ 2 , • • •, Xk] ■ = max and so we need some way of encoding 
our guessed successor relation. We encode our relation as: 

M[ 0, Mi] = max , M[u\, U 2 ] = max ,..., M[u m , max] = max, 

where M is a new array symbol of dimension 2. Of course, we ensure that the elements 
of {0, zti, U 2 , • • •, u m , max} are distinct as we guess. Note that we need to remember 
the previously guessed element, Ui, so that we know to set M[zij,M i+1 ] equal to max. 
We also need to modify our code so that an atomic relation of the form succ(x, y) is 
replaced by the formula M[x,y] = max. Arguing as above yields the result. □ 

One view of Theorem [lj is that it provides syntactic characterizations (via the 
the classes of program schemes NPSA(l) and NPSB(l)) of semantically defined com¬ 
plexity classes (namely, PSPACE n EXT and NP fl EXT). 

Corollary 15 NP = PSPACE if, and only if, NPSA( 1) = NPSB( 1). 

Proof If NP = PSPACE then NPn EXT = PSPACEn£AT; and so NPSA(l) = 
NPSB(l) by Theorem |lj. Conversely, if NPSA(l) = NPSB(l) then NPSA s (l) = 
NPSB s (l); and so NP = PSPACE by [|l| and Corollary [l(| □ 

Corollary [l^ is somewhat surprising given that every problem in NPSA (and so 
NPSB) has a zero-one law. In fact, any problem in EXT, apart from the empty prob¬ 
lem (over some signature), has a ‘ 1-law’ (the meaning of ‘1-law’ should be obvious); 
and so any non-trivial problem in NPSA(l) (and NPSB(l)) has a 1-law. Note that 
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any class of problems each of which has a zero-one law can not contain, for example, 
the computationally trivial problem consisting of all those structures of even size. 

Corollary [l 5 | can be extended slightly in that we can obtain some additional equiv¬ 
alences involving fragments of certain vectorized Lindstrom logics. Referring back to 
Theorem the problem mentioned in that theorem is actually defined as follows. 

Definition 16 Let the signature a a = (Ti,Ts,M,C), where M is a unary relation 
symbol, Tf is a binary relation symbol, X3 is a relation symbol of arity 4 and C is a 
constant symbol. We can envisage a <r a -structure A as a Petri net whose places are 
given by \A\ and whose transitions are given by Xi and X2 via: 

• there is a transition ({a;}, {j/}) whose input place is {x} and whose output place 
is {y} if, and only if, T\{x,y) holds; and 

• there is a transition ({xi, X2}, {yi, 2/2}) whose input places are {xi,X2} and 
whose output places are {221,2/2} if, and only if, T%(x\, X2, 2/1, 2/2) holds, where 
xi / x 2 and yi ^ y 2 - 

The relation M can be seen as providing an initial marking (with one token on place p 
if, and only if, M ( p ) holds) and the constant C as providing a final marking (consisting 
of one token on the place C). 

A er a -structure A, i.e., a Petri net, complete with inital and final markings, where 
every transition has either 2 input places and 2 output places or 1 input place and 1 
output place, is in the problem fl a if, and only if, there is a marking covering the final 
marking that is reachable from the initial marking, i.e., there is a reachable marking 
in which there is at least one token on the place C. □ 


Corollary 17 The following are equivalent. 

(a) NP = PSPACE. 

(b) NPSB{ 1) = NPSA( 1). 

(c) nl[FO]=n 1 a [FO]. 

( d ) The problems fib and fl a are equivalent via quantifier-free first-order translations 
with 2 constants. 


Proof Corollary |l 5 | implies (a) <t=> ( b ). Theorems |] and |] imply ( b ) <t=> (d). It is 
trivially the case that (c) => (a) and that (d) => (c). □ 

We end by returning to an earlier remark concerning the NPSB hierarchy on the 
class of all finite structures. We include the following result here as we can utilize 
results of this section, and this result also applies to the NPSA hierarchy. 

Proposition 18 On the class of all finite structures, 

NPSB( 1) c NPSB(2) c NPSB(3). 
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Proof By Lemma 0 every problem in NPSB(l) is closed under extensions; and so, 
trivially, NPSB(l) C NPSB(2). 

Consider the following first-order sentence over the signature <72 = (E, C ), where 
E is a binary relation symbol and C is a constant symbol: 

3 x(E(C, x ) A 3 y(E(x, y) A \/z(E(x, z) => z = y))). 

There is clearly a program scheme of NPSB(3) accepting the problem f 1 defined by this 
sentence. For any k > 1, consider the digraphs, Ak and Bk, depicted in Fig. 2 (note 
that Bk only differs from Ak by having an extra vertex and edge). No matter what 
the value of k, Ak S kl but Bk ^ f2. We shall show that for any program scheme p of 
NPSB(2), there exists some k such that Ak \= p implies that Bk |= p■ This will yield 
our result. 



◄- k copies-► ◄- k copies-► 

Figure ||. The digraphs Ak and Bk- 


Let p be a program scheme of NPSB(2) of the form MxkVx 2 .. -Vxkp', for some 
program scheme p' of NPSB(l), and let Bk be an extension of Bk by k constants 
(one for each variable Xi). There is an extension of Ak , denote it Ak, such that Ak is 
embeddable into Bk via a one-to-one mapping: call the mapping 7 r. Suppose that there 
is an accepting computation of p' on input Ak- We can ‘mirror’ this computation by a 
computation of p' on Bk by making guesses according to the mapping 7r (after having 
chosen our constants 0 and max , again according to 7r). The two computations of p ', 
on Ak and Bk, proceed in tandem (in that their flows of control are identical) and 
because the computation of p' on Ak leads to acceptance, so must the computation 
of p' on Bk (recall, any tests are quantifier-free first-order and so only ever refer to 
the current values of variables). Our result follows. □ 

We add that the proof of Proposition [l^ suffices to show that, on the class of all 
finite structures, NPSA(l) C NPSA(2) C NPSA(3). 

7 Conclusions 

In this paper, we have examined the computational capabilities of different classes of 
program schemes, based around ‘binary write-once arrays’, on the class of finite struc¬ 
tures, the class of ordered finite structures and with respect to different semantics. 
We now discuss some potential directions for future research. 

Perhaps the most obvious unanswered question is as regards the NPSB hierarchy: 
‘Is it the case that, like the NPS and NPSS hierarchies, the NPSB hierarchy is proper 
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at every level?’ (the same question can be asked for the NPSA hierarchy). So far, 
we have not been able to answer this question (beyond Proposition ]j±|). The main 
reason for the lack of progress is that whereas in Q we were able to ‘re-use’ domain 
elements so as to ‘mirror’ computations of program scheme of NPS and NPSS (in 
the style of the proof of Proposition ^8[), the existence of arrays means that we can 
‘remember the values already used’ in a computation and consequently it is not clear 
that domain elements can be re-used in a suitably anonymous fashion (the reader is 
referred to Q], and the proofs therein, in order to make more sense of this remark). 
The fact that working with program schemes of NPSB takes us outside the ‘bounded- 
variable world’ of the logic (see Corollary ||), whereas this is not thecase with 
the program schemes of NPS and NPSS, is particularly intriguing in this respect. 

The results in Section 6, relating the computational capabilities of the classes of 
program schemes NPSB(l) and NPSA(l), are in the style of Abiteboul and Vianu 
[jl( |[, Abiteboul, Vianu and Vardi Q and Dawar Q. However, we would prefer 
to have determined similar results but regarding the classes NPSB and NPSA (or, 
equivalently, the logics (±Hf,)*[FO] and (±H a )*[FO]). So far, we have been unable to 
extend the results of Section 6 to these classes of programs schemes. There are some 
very straightforward implications to be made however. For instance (on the class of 
all finite structures): 

• by Corollary |l7|, if NP = PSPACE then NPSB = NPSA (and, equivalently, 
(±fi 6 )*[FO] = (±n Q )*[FO]); 

• by Theorems Cl and 111], if NPSB = NPSA (or, equivalently, (±170* [FO] = 
(±O0* [FO]) then L^= PSPACE; and 

• by Theorems [f] and [ll], if S2j)[FO] = 12* [FO] then NP = PSPACE (as any 
problem in I2j)[FO] can easily be seen to be in NP). 

We would like to be able to equate the questions: ‘Is L NP equal to PSPACE?’, ‘Is 
NPSB equal to NPSA?’ and ‘Is (±I2b)*[FO] equal to (±I2 Q )*[FO]?’; as well as the 
questions: ‘Is NP equal to PSPACE?’ and ‘Is I2j)[FO] equal to 12* [FO]?’. As yet, we 
have been unable to do so. 

Finally, let us return to the decision problem described at the beginning of Section 
4 involving the traversal of a digraph subject to the utilization of user and system 
resources. We feel that this problem, and its variations, are very relevant in the 
study of the complexity of agent-based, systems. Essentially, an agent-based system is 
an environment within which an agent must successfully accomplish a task. Agents 
interact with the environment by performing actions and these actions can result in 
a change of state of the environment. The reader is referred to (d|] for some basic 
definitions and complexity-theoretic results in agent-based systems. Our resource- 
dependent digraph traversal problem can easily be viewed as an agent-based system, 
and we intend to investigate exactly how the study of program schemes and logics 
can impact upon that of agent-based systems in a future paper. 
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